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1.0 INTRODUCTION 

During the past quarter, the emphasis of the Skylab agricultural 
investigations has concentrated on the quantitative evaluation of the 
S190A and B photographic data from the Earth Resources Experimental 
Package (EREP) . Analysis of data from the SI 92 multispectral scanner 
is being delayed due to the lack of noise-free computer compatible 
tapes. The following sections of this quarterly progress reportdocu- 
ment those tasks which have been accomplished during this reporting 
period, and those tasks which are projected for the remainder of the 
contract performance period. 

2.0 WORK PERFORMED DURING THE PERIOD COVERED BY THIS REPORT 

2.1 Status of Skylab Data and High Altitude Aircraft Imagery 

2.1.1 Skylab 2 


All photography of our study areas that was acquired from Skylab 2 
(Track 63 , 3 June 1973) has been received along with its supporting 
high altitude aircraft imagery. During the June overpass, the S190B 
camera was not in operation; however, high quality S190A photography 
was acqui red . 

To date, only one S 1 92 digital tape (containing 5 seconds of data) 
has been received which covers the southern portion of the San Joaquin 
Valley test site (see Figure 1). Of the ten channels on this tape, four 
were considered unusable -- channels 1 and 13 were almost completely 
saturated, and a high degree of systematic noise was present in channels 
6 and 11. The remaining channels — 2, 3, 8, 9, 10 and 12 were 
usable for CALSCAN* analysis. Complete coverage of the test area with 


*CALSCAN is the RSRP version of the LARS-Purdue pattern recognition 
program adapted to the CDC 6600/7&00 system at the University of 
California, Berkeley. 
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Track 63 26 Jan Jk 

2 June 73 


Figure 1. The above map shows the state of California and the loca- 
tion of the agricultural test sites with respect to the ground tracks 
from the three Skylab missions. 



noise-free data ha* been ordered and is scheduled to be received in 
April 197 2 *. 

2.1.2 Sky lab 3 

One EREP pass along Track 59, 13 September 1973, was made during 
Sky 1 ab 3. Color and color infrared imagery from the S190A camera has 
been received along with color imagery from the S190B camera. Noise 
corrected data from the SI 92 scanner has been ordered and should be 
received in June 197^- 


2.1.3 Sky lab 4 

Two EREP passes were made along Track 63 on 3 December 1973 and 
on 26 January 197*». Data deliveries for S190A and S1908 cameras are 
expected in late spring and early summer; no delivery date has been 
qiven for the SI 92 scanner data. Because the Skylab vehicle returned 
to its original orbit during this mission, the amount of overlap cover- 
age with preceeding missions is limited, particularly for the SI 92 
mul t i spectral scanner. This overlap area is shown in figure I. 


2.2 Manual Crop Inventory 

To date, most of the identification and inventory of agricultural 
crops using satellite data (i.e., ERTS-1) have been done sem.- _ 
automatically by computer analysis of digital tapes with human inputs 
of training materials. Although such systems have achieved very 
accurate results in certain test areas, there is stin a need to 
develop techniques for manual interpretation of satellite data for 
agricultural resource inventories. While computer-based systems may 
ultimately provide the most efficient method for gathering agricultural 
statistics of extensive areas, at the present time the human interpreter 
represents the most expedient way to perform an operational inventory 
in the United States. For many of the emerging nations of the world 
where both national agricultural statistics and computer systems may 
be non-existent, the data gathered by human interpreters can provide a 
valuable input to the management decisions for agricultural resources. 


In any attempt to develop efficient techniques for the human inter 
pretation of Skylab data, several factors must be considered: U) 

because of the large areal coverage of the imagery, 100 percent image 
interpretation of the entire frame for detailed information is not 
practical, (2) a simple method is needed to evaluate the accuracy of 
the interpreter's estimates and, if necessary, to adjust these esti- 
mates, and (3) the low resolution of the satellite imagery makes 
accurate acreage estimates by human interpreters impractical. 


An inventory technique employing a double sample utilizing point 
data is being designed to deal effectively with these constraints. A 
single crop, namely cotton, has been selected on which these techniques 
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are being tested. For the first stage in the sample design (large 
sample), the interpreter will determine the presence or absence of 
cotton at a large number of points located throughout southwestern 
Fresno County. (This test area was chosen because of the a/a i labi I ity 
of sequential photographic coverage from SL-2 and SL-3) • These data 
will then be used to estimate by means of manual interpretation the 
proportion of the area that is planted to cotton. This proportion, 
when multiplied by the total area being inventoried will give an 
unbiased estimate of the total crop acreage. The second stage in the 
inventory will consist of a subsample of the large sample. This sub- 
sample will be drawn from those points from the large sample which fall 
within "ground truth" fields for which the crop type is known. The 
correlation between "ground truth" and image interpretation estimates 
will be used to evaluate interpreter accuracy and to calculate a ratio 
estimator to adjust the interpreter's estimated proportion. 

It should be noted that in operational surveys which would utilize 
real time satellite data, the method for selecting the subsamples would 
differ slightly. The ground samples would not be chosen until after 
the satellite data had been received and interpreted, thus eliminating 
the need for maintaining permanent ground data points. However, the 
technique as first described still has potential applications since 
many governmental agencies maintain some form of permanent ground plots 
for which accurate data are collected. This technique would allow the 
agencies to expand these data more accurately with the use of remotely 
sensed data. 

2 . 3 Automatic Interpretation 

The planned automatic data processing studies call for extracting 
agricultural information from EREP computer-compatible tapes and pho- 
tographic transparencies. The processing of both types of data will 
give a comparison of classification results using an analog storage 
medium (film) versus digital storage (tape). 

2.3-1 Mul t ispectral Scanner Data 

Initial experimentation with data from the S 1 92 mul t i spect ral 
scanner was begun using only six channels: 2, 3> 8, 9> 12 (see 

Section 2.1.1). Spectral data for seven crop types, viz., immature 
barley, mature barley, cucumber, plowed-fal low, tomato, cotton and 
alfalfa, were extracted from a section of the digital tape. From a 
statistical analysis of these data it was determined that channels 2, 

3, 8 and 12 only would be needed for the CALSCAN classifier because the 
data contained on channels 9 and 10 were highly correlated with those 
on channel 8 and therefore were considered redundant. 

Of the 3,500 acres (1,400 hectares) of cropland that were classi- 
fied, 75 percent of the fields were identified correctly. The visual 
results of a portion of this area are shown in Figures 2a and 2b. 
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Figure 2. The results of CALSCAN analysis of the SI 92 scanner data using a medium. (a) and a f ine 
(b) threshold. The area shown here is located west of Fresno, California. Data within the slack 
rectangular boxes were used for training the automatic classifier. See text for additional discus- 

s ion . 




Figure 2a was formed from a CALSCAN analysts which used a medium thresh- 
old decision rule and Figure 2b used a fine threshold. It is evident 
that even when using the medium threshold, a considerable amount of 
noise, as indicated by horizontal black lines, is still present on the 
tapes. In addition, extensive classification errors occurred, indi- 
cated by other color anomalies within fields, which was not due to the 
quality of the data per se, but to the immaturity of the crops at the 
time of the Sky lab overpass which makes crop discrimination difficult. 

An example of this confusion is shown in both Figures 2a and 2b for 
immature cotton (white) and immature cucumbers (purple). Despite the 
noise present on the data and the non-optimai acquisition time of the 
data, these results are very encouraging when one considers that 
neither channel b (green) nor channel 6 (red) was avai’able for use in 
the classifier. Once these two channels become available, we expect 
the classification accuracy to improve; considerably. 

2.3.2 Hul tispectral Camera Data 

All four bands of the SI 92 black-and-white photographic data 
obtained during the SL-2 mission were scanned with a microdens i tometer 
for an area covering the northern half of the Sal inas Val ley test site. 
The scan interval along both the X and Y axes was .01 inches, and the 
aperture of the scanner was set at .001 inch diameter 50 that each data 
point represented a 1.12 acre spot size on the gorund. The resulting 
density measurements were recorded on magnetic tape and were subsequently 
reformatted on the CDC 6600/7600 system so that they would be compatible 
with the CALSCAN program. Training fields were extracted from the 
reformatted tape, and the STAT subprogram* of CALSCAN was used to pro- 
duce various descriptive statistics. 

The mean density and standard deviation as scanned from the four 
bands for tomato, asparagus, sugar beets, lettuce, carrots, cauliflower 
and beans are shown in Table lj these spectral densities, as shown in 
Table 2, overlap in every band. It is evident from these statistics 
that data from one band of imagery alone will be insufficient to dis- 
criminate between these crops. Uniform appearance, i.e., small devia 
tlon from the mean, facilitates automatic classification. However, the 

types which display these small deviations, such as tomato and 
asparagus on the .8 - -9 pm band, also show a correspondingly small dif- 
ference between their actual means, and thus they have virtually 
identical density distributions. Accurate classification of these 
vegetable crops using either automatic or manual techniques will be 
difficult because of the similar states of development of these crops 


*The STAT subprogram of CALSCAN performs statistical analyses on speci- 
fied training fields and prints the results in various forms as 
directed by the user. These data indicate which classes are separable 
on the basis on training data given and also which features offer the 
best separation. 
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TABLE 1. STATISTICS BY CROP TYPE 
DERIVED FROM SCANNED 
S190A PHOTOGRAPHIC DATA 



BAND 1 
.7 - .8 m 

BAND 2 
.8 - .9 pm 

BAND 5 
.6 - .7 Pm 

BAND 6 

.5 - .6 pm 

TOMATO 

X 

97.96 

129.13 

127.99 

136.22 


S .D. 

28.91 

11.97 

36.09 

40.13 

AS PARA G US 

X 

109.08 

126.31 

151.74 

126.64 


S.D. 

13-65 

7-38 

i 

28.56 

24.01 

SUGAR BEETS 

X 

196.92 

115.47 

146.36 

145.41 


S.D. 

i 

21 .00 

19.96 

37.67 

44.37 

LETTUCE 


131-48 

122.50 

121 .47 

119.17 



39.23 

23-37 

30.40 

27.31 

CARROTS 


154.30 

108.77 

1 12.92 

114.34 


S.D. 

33-24 

27.24 

33.38 

13.93 

CAULIFLOWER 

X 

j 

184.50 

1 18.29 

145.71 

144.06 


S.D. 

31-03 

10.42 

23.15 

32.19 

BEANS 

X 

90.05 

122.56 

145.32 

137.82 


S.D. 

20.88 

23.79 1 

37.65 

42.01 


Note: Shown here are the mean density (7) and standard deviation for 

the seven crop types present in the Salinas Valley as scanned from the 
four black-and-white EREP transparencies. These stat i sties wi 1 II be 
used in part to select the optimum bands for automatic classification 
of crop type using the CALSCAN program. 
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TABLE 2. DENSITY DISTRIBUTIONS BY CROP TYPE 
DERIVED FROM SCANNED S190A PHOTOGRAPHIC DATA 



.5 _ -6 Pm .6 - .7 P .7“. 8 pm .8 “ .3 pm 


CROPS 

1 = Tomato 

2 = Asparagus 

3 - Sugar Beets 

4 = Lettuce 

5 = Carrots 

6 = Caul i flower 

7 = Beans 

Note: The relative distribution of densities of these vegetable crops on all four S190A black-and- 

white transparencies taken in June 1973, indicate that (1) no one band alone will be sufficient to 
discriminate between the crops, and (2) even by using the data from all four bands, accurate crop 
discrimination may not be possible. See text for further discussion. 



at the time of image acquisition. In June in the Salinas Valley, 
tomato, carrots, lettuce and cauliflower were just emerging from the 
soil and had the same basic appearance as bare soil. Improvement in 
crop identification accuracy is expected from the SL-3 data because 
all crops will be fully mature. When SL-2 (June) and SL-3 (September) 
data taken of the same area are combined, variations in cropping 
cycles will greatly facilitate discrimination between crops that have 
similar spectral reflectance characteristics on any single date. 

3.0 WORK PLANNED FOR NEXT REPORTING PERIOD 

3.1 Introduction 


Due to the delays that are expected in receiving all S 1 92 data it 
is doubtful whether much additional work with these data will be 
accomplished during the final quarter of the contract performance period. 
However, complete analysis of all photographic products will be com- 
pleted within this time frame. 

3.2 Manual Interpretation 

3.2.1 Crop Inventory 

The double sample design as described in Section 2.2 will be com- 
pleted. The final results from this study will include acreage esti- 
mates, sampling error, and an overall evaluation of the technique. 

3.2.2 Irrigated Land Study 

There presently exists an important need on the part of the 
Department of Water Resources of the State of California for a periodic 
tabulation in any given year of the statewide acreage of agricultural 
land receiving irrigation. A study will be performed to investigate 
the extent to which this tabulation can be accomplished using two dates 
of EREP data and the appropriate sampling designs. This study will 
not be able to give a valid yearly statistic of the irrigated lands 
within the test area due to the fact that only two dates of imagery are 
expected to be available and imagery taken at other dates would be 
required. However, it will help determine (l) the feasibility of mak- 
ing such inventories, (2) the expected accuracies of such inventories, 

(3) whether similar techniques could be used on ERTS-I imagery, and (A) 
the probable cost for making such inventories on a statewide basis. 

3 . 3 Automatic Interpretation 

3-3.1 Mul t i spectral Scanner Data 

As stated In previous sections of this report, analysis of the 
Si 92 data has been delayed. The extent to which this analysis can be 
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completed by the end of the contract performance period will be con- 
tingent upon the timely receipt of noise-corrected data for all channels 
covering the entire test area from SL-2, -3 and -4. With these data we 
will be able to (1) determine the relative contribution each channel 
makes in the CALSCAN classifier, (2) modify existing software to enable 
the overlaying of ground Tracks 63 (SL-2) , S3 (SL-3) and 63 (SL-4) to 
facilitate multidate analysis, and (3) compare the spatial and spectral 
resolutions of Skylab and ERTS with respect to accuracy of crop identi- 
fication and costs of data processing. 

3.3.2 Mult ispectral Camera Data 

Black-and-white S190A transparencies from SL-3 have just recently 
been scanned, and the data are now being processed. We expect that 
the classification accuracy achieved with these data will be much more 
accurate than that from the SL -2 data, because the crops within the 
test area have matured and should have more unique spectral signatures 
than they displayed in the June pass of SL-2. 

3.4 Multistage Sampling of Agricultural Resources 

The culmination of our Skylab agricultural experiments will be the 
demonstration of agricultural survey techniques using a multistage 
sampling model. Through use of the discriminant analysis techniques 
described below, Skylab photo and scanner data will be combined with 
aerial photos and ground data to give an estimate of crop acreage within 
the areas common to Track 63 of Skylab 2 and Track 59 of Skylab 3* We 
have not yet chosen which crop or crops will be inventoried; this will 
be determined after the data from the S 1 9 2 mul t i spectra 1 scanner have 
been analyzed. Under normal operational conditions, the optimum times 
for obtaining remote sensing data for a successful agricultural 
inventory are known and are carefully adhered to. However, since this 
is not possible under the restrictions of the EREP missions, the crops 
to be surveyed will be those which can best be inventoried with the 
available Skylab data. 

The multistage model relies heavily upon the first stage in which 
the information extracted from the Skylab data by human interpreters 
and automatic classifiers provides the initial estimates of the resource. 
The first step of the data extraction process will be one in which 
human interpreters stratify all fields within the area of interest into 
broad land use categories and crop classes based on their appearance 
on the Skylab S190A ektachrome imagery. At this time political and 
geographic boundaries will be superimposed on the imagery to further 
define the areas of interest. Next, a number of fields which 
represent the various agricultural r esou rces of interest will be 
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selected from each stratum to train the discriminant analysis pro- 
gram. The identities of these fields will be determined from ground 
data and/or the interpretation of aerial photos. The number of train- 
ing fields required for each crop class will be dependent upon the 
variability of the spectral signatures of the crops present. This 
variability is caused by such factors as different cropping practices, 
local soil differences, and genetic variations within each particular 
crop type. For example, a crop such as alfalfa which may be in several 
stages of maturity throughout the survey area at the time of image 
acquisition may require five or more fields per stratum for adequate 
training, whereas only one training field per stratum may be needed for 
a less complex crop type such as corn. After the fields have been 
chosen, they will be located on, and extracted from, the spacecraft 
imagery. The multispectral data from the training fields will be run 
through the discriminant analysis program to obtain a point-by-point 
classification of the entire area by strata. This procedure will pro- 
vide an initial estimate of the acreage of the vegetation classes. 

In the second stage of the model, the results of the discriminant 
analysis will be sampled to determine their relationship to ground 
estimates of the resource. Sampling units (SU) will be defined by 
dividing the entire area into rectangular blocks. The size and shape 
of these blocks will be determined by (1) the information requirements, 
(2) the variability that occurs in the SU estimates as the block size 
changes, (3) the costs of further subsampling, and (*0 the resolution 
of the scanner imagery. 


Probability sampling is expected to provide the most efficient 
sampling design that can be applied in the second stage of the model. 
Probability sampling is a special case of the mean of the ratios esti 
mation where samples are allocated proportional to the expected va Tth 
lance of the X. estimate. For this model, the total value of the i 
SU, denoted by'x,. Is evaluated by 


X. 

1 


m=T j = 1 


I V. 
m j 


where 



M 

V. 

J 

J 


0 otherwise 

crop class for the n^ "pixel" (picture element) 
of the SU, as determined by the discriminant 
anal ys i s , 

the number of "pixels" per SU, 
the crop class being evaluated 

the number of crop classes. 
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The value or weight (v.) is assigned to rank the various crops or veg 
etation types based on J their relative importance . to the survey. In an 
agricultural inventory where total dollar value is the objective, the 
v 's are the average dollar values per "pixel" of the various crops 

(j). If the making of an inventory of a single crop is the objective, 
the value of the crop of interest is 1 and all other v. s bv ° the 

zero. In many cases this weighting factor is pr imar i 1 /affected by 
marketing conditions of each crop, and is highest ior those crops 
which acreage estimate errors are most Important* 

The variance of the population is estimated by 


s 2 - 


The number (n) of SU's 
ment when no remote sensing 


to be selected for photo and ground measure- 
information is available is determined by 


n 


2 2 

Nt S 

? 2 2 
N(AE) Z + tV 


where 


AE 

t 


the allowable error, in units of value 

is a value obtained from "students t" tables and 

is as defined previously. 


to 


The n points are then selected from the list of SU's proportional 
their estimated value. 


The 


selected SU's are then carefully transferred to the correspond- 
ing high flight photography where precise field size measurements are 
made for use later in adjusting the acreage estimates obtained from the 

classi f ier . 


From high fl ight images, 
tion and historical data, the 
in the SU is determined, down 


low altitudes images, ground identifica- 
"correct" classification for each field 
to crop type and maturity. 


The total value for the area (T) is estimated us i ng the probabi 1 i ty 
of selection (P.) and the photo/ground estimate of SU value IY.) by 
means of the relation. 
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T « 1 £ II 

1 n i=l P. 

i 


where P, = tt 
i N 


x. 

i 


ir- x. 

]=\ 


The variance of the estimate for T is 


s - 2 = Var(T) = ~ P, 
T n i = l 



A 

T) 


2 


If the photo/ground estimate (Y. ) were to be perfectly proportional 

to the remote sensing estimate (X.), only one ground sample would be 

needed to determine the proportionality constant. More realistically 
however, the number of ground samples (n) for future surveys is esti- 
mated by: 


n 


2 2 
Nt s‘ 

* 5 2 2 

N(AE) + t s" 


This probability sampling model is appropriate when a single 
parameter such as "acreage of a single crop", "value of all the crops 
present", or "demand for irrigation water" is desired. Such a model 
can be replaced by a regression sampling model if estimates on a crop- 
by-crop basis are required; however, the regression sampling model will 
only meet the allowable error criterion for the total value of all 
crops by strata. 
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